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Q-l Abstract 

We consider the problem of fc-colouring a random r-uniform hypergraph with n vertices 
f**". . and cn edges, where k, r, c remain constant as n — > oo. Achlioptas and Naor showed that the 

chromatic number of a random graph in this setting, the case r = 2, must have one of two 
easily computable values as n — > oo. We give a complete generalisation of this result to random 
uniform hypergraphs. 

</5 . 1 Introduction 

! We study the problem of /c-colouring a random r-uniform hypergraph with n vertices and cn edges, 

J> ' where k, r and c are considered to be constant as n — > oo. We generalise a theorem of Achlioptas 

and Naor [3] for fc-colouring a random graph (2-uniform hypergraph) on n vertices. 

Their theorem specifies the two possible values for the chromatic number of the random graph as 
n — > oo. We give a complete generalisation of the result of [4]. We broadly follow the approach of 
Achlioptas and Naor [3], although they rely on simplifications which are available only in the case 



o 



OO 

| r = 2. We show that these simplifications can be replaced by more general techniques, valid for all 

k, r > 2 except k = r = 2. 

There is an extensive literature on this problem in the case r = 2, colouring random graphs. In the 
^ ■ setting we consider here, this culminates with the results of Achlioptas and Naor [4], though these 

do not give a complete answer to the problem. Our results here include those of [1]. 

There is also a literature for the case k = 2, random hypergraph 2-colouring. Achlioptas, Kim, Kriv- 
elevich and Tetali [2] gave a constructive approach, but their results were substantially improved 
by Achlioptas and Moore [3], using non-constructive methods. The results of [3] are asymptotic 
in r. Our results here include those of [3j, but we also give a non-asymptotic treatment. Recently, 
Coja-Oghlan and Zdeborova [5 J have given a small qualitative improvement of the result of [3], 
which goes beyond what can be proved here. See these papers, and their references, for further 
information. 
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Finally, we note that Krivelevich and Sudakov [12] studied a wide range of random hypergraph 
colouring problems, and some of their results were recently improved by Kupavskii and Sha- 
banov [13]. But, in the setting of this paper, these results are much less precise than those we 
establish here. 



1.1 Hypergraphs 

Let [n] = {1,2,..., n}. Unless otherwise stated, the asymptotic results in this paper are as n — > oo. 
We consider the set Q(n, r, m) of r-uniform hypergraphs on the vertex set n and with m edges. 
Such a hypergraph is defined by its edge set £, which consists of m distinct r-subsets of n. Let 
N = (™) denote the total number of r-subsets. 

The standard model of a random hypergraph is one chosen uniformly at random from Q(n,r,m). 
Its edge set £ may be viewed as a sample of size m chosen uniformly, without replacement, from 
the N possible edges. We will write G G Q(n,r,m) for a hypergraph G generated in this fashion. 
However, it is convenient to work with the following simpler random hypergraph model G*(n, r, m). 

Let Q*(n, r, m) be the set of all r-uniform multi-hypergraphs on [n], defined as follows: each element 
of f2*(n, r, m) consists of vertex set [n] and a multiset of edges, where each edge is now a multiset 
of r vertices (not necessarily distinct). We can generate a random element of G of Q*(n,r,m) 
using the following simple procedure: choose v = (t>i,t>2, • • • ,v rm ) G [n] rm uniformly at random 
and let the edge multiset of G be {ei, . . . , e m }, where ei = {v r (i-l)+l> • • • i v n} for i G [m]. Write 
G £ Q* (n, r, m) for a hypergraph G generated in this fashion. 

Although we will refer to it as an r-uniform hypergraph, observe that G 6 SI* (n, r, m) does not 
necessarily satisfy the definition given above, for two reasons. First, an edge of G may contain 
repeated vertices, which Q(n,r,m) does not permit. We call such an edge defective. Second, an 
edge of G £ 0*(n,r, m) may be identical to some other edge, which again f2(n, r, m) does not 
permit. We call such an edge a duplicate. Note that Q* (n, r, m) is not the uniform probability 
space over fT (n,r, m), but that all G without defective or duplicate edges are equiprobable. Thus 
Q* (n, r, m), conditional on there being no defective or duplicate edges, is identical to Q(n,r,m). 

If E n is a sequence of events, we say that E n occurs "asymptotically almost surely" (a.a.s.) if 
Vx(E n ) — > 1 as n — > oo. In this paper, the event E n usually concerns G G Q*(n,r,m), where 
m(n) = [cn\, for some constant c. We write this as G € Q*(n,r,cn), and similarly for the other 
models we consider. Note that lim n _ s>00 m(n)/n — > c when m(n) = [en J . In fact, any function m(n) 
such that m(n)/n — > c would suffice for our purposes, though we will not use this. 

If G G G*(n, r, cn), for any edge e G £, let pdef = Pr(e is defective). Then we have 

n(n — 1) ■ ■ ■ (n — r + 1) / r(r — 1) / 1 \\ r(r — 1) 

Pdef = 1 ~ = 1 - exp + Of — I , 

rv \ In \n L / ) In 

and the edges are independent for e G £. Since m ~ cn, we have 

Pr(no defective edge) ~ exp ( - r ( r ~ l ) m \ „ e -cr(r-l)/2 < (1) 

V 2n / 

Let mdef be the total number of defective edges. Then m^ e f has distribution Bin(m,pdef)> an d 
E[m,dcf] = mpdei ~ cr(r — l)/2, as n — > oo. Hence Chernoff's bound [11, Corollary 2.4] gives, for 
large enough n, 

Pr(m de f > Inn) < e" lnn = 1/n. (2) 
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Hence G G Q*(n,r,cn) has at most Inn defective edges a.a.s. 

Conditional on there being no defective edge, £ is a uniform sample of size m chosen, with replace- 
ment, from the N possible r-subsets of [n]. Thus 

N (N - 1) • • • (N - m + 1) 



Pr (no duplicate edge | no defective edge) 



N 



,.2 



, m(m - 1)\ e c if r = 2, 

ex P ^7 ~ \ , ., ( 3 ) 



2iV ; l if r > 2. 



Thus, if r > 2, there are no duplicate edges a.a.s. If r = 2, let md up be the total number of duplicate 
edges. Then E[md up ] < m(m — 1)/N < 3c 2 , for large n, since each edge e G £ has (m — l)/iV 
duplicates in expectation. Thus, using Markov's inequality [TTJ (1.3)], 

3c 2 

Pr(m dup >lnn) < — = o(l), (4) 
Inn 

and hence mdup < Inn a.a.s. With more work, it is possible to obtain a smaller error probability, 
similar to that in ([2]), but (J4]) will suffice for our purposes. 

Let us call an edge bad if it is a defective or duplicate edge. Conditional on there being no bad 
edges, G*(n,r,cn) is identical to Q(n,r,cn). Combining ([1]) and © gives, for G G Q*(n,r,cn), 

{ e -c(c+i) if r = 2 
e -cr(r-l)/2 ifr>2 ' (5) 

So G has no bad edges with probability fJ(l) as n — ^ 00 • Now ([S]) clearly implies that any event 
occurring a.a.s. in Q*(n,r,cn) occurs a.a.s. in Q(n,r,cn). 

Combining ([2]) and the number of bad edges m-bad = mdef + mdup < 21nn a.a.s. We will use 
this in Lemma fl~2l below to show that Q{n, r, cn) and G*(n, r, cn) are essentially equivalent, for our 
purposes. 

We also make use of the following simple property of Q*(n, r, m). A vertex i G [n] of G G f2*(n, r, m) 
is isolated if it appears in no edge. Then the following simply restates the determining property of 
an isolated vertex, that it is absent from v. 

Observation 1.1. If S C [n], let Is be the event that all vertices in S are isolated in G G G*(n, r, m). 
Let G' be G conditional on Is- Then G' G Q*{n — \S\,r,m), up to relabelling vertices in [n] \ S. 

We show, in the proof of Lemma l4.lt that G G G*(n,r,cn) has O(n) isolated vertices a.a.s. and 
hence G has many disconnected components. 

A further model of random hypergraphs is often used, which we will denote by Q(n,r,p). In this, 
the edge set £ of G is chosen by Bernoulli sampling. Each of the ./V possible r-subsets of [n] is 
included in £ independently with probability p. Essentially, this is Q(n, r, m) where m is a binomial 
random variable Bin(A r ,p). We show in Section 11.21 below that Q(n,r,cn/N) and Q(n,r,cn) are 
equivalent for our problem. 

1.2 Hypergraph colouring 

Let N denote the set of positive integers and define No = N U {0}. A function a : [n] — > [k] is 
called a /c-partition of [n], the blocks of the partition being the sets cr _1 (i), with sizes n, = |<7 -1 (i)| 
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(i £ [k]). Let 11^ denote the set of ^-partitions of [n], so |IIfc| = k n . A /c-partition is called balanced 
if [n/k\ < rii < \n/k\ for i = l,...,k. For convenience, we will assume that k divides n, so 
Ui = n/k for % = 1, . . . , k. Since we suppose k to be constant, the effects of this assumption are 
asymptotically negligible as n — > oo. Let denote the set of all balanced ^-partitions of [n], so 
|3*| =n\/((n/k)\) k . 

A /c-colouring of a hypergraph H = Qn],£) is a /c-partition a such that for each edge e £ £, the 
set a(e) satisfies |<r(e)| > 1. (We use the notation H for fixed hypergraphs and G for random 
hypergraphs.) We say an edge e 6 £ is monochromatic in <r if |<r(e)| = 1, so a /c-partition is a 
colouring if no edge is monochromatic. The chromatic number x{H) is the smallest k such that 
there exists a /c-colouring of H. A balanced /c-colouring of a H is a balanced /c-partition which is 
also a /c-colouring of H. Following [3], the analysis below will be carried out mainly in terms of 
balanced colourings, assuming that k divides n. These choices will be justified below. 

Note that what we study here is sometimes called the weak chromatic number of the hypergraph. 
The strong chromatic number is defined similarly in terms of strong colourings, which are k- 
partitions a such that |<r(e)| = |e| for each edge e £ £. Even more general notions of colouring may 
be defined. See, for example, [12] , We will not consider this further here, though it seems probable 
that the methods we use would be applicable. 

The principal objective of the paper will be to prove the following result. 

Theorem 1.1. Define u r ^ = k r ~ 1 lnk for integers r > 2 and k > 1. Suppose that r, k > 2 and 
G G Q(n, r, cn). 

(a) If c > u r ^ then a.a.s. x(G) > k. 

(b) If max{ r,k} > 3 then there exist constants c r ^ such that for any c < c r ^, a.a.s. x{G) < k. 

(c) If m&x{r, k} > 3 then the constants c r ^ from \(b)\ satisfy u T) k-i < Cr,k < u T) k- 

Now the following theorem, which is a complete generalisation of the result of [3] to uniform 
hypergraphs, follows easily. 

Theorem 1.2. For all r, k > 2, if u r ^-i < c < u r ^ then a.a.s. the chromatic number of G £ 
G(n, r, cn) is either k or k + 1. 

Proof. Let G £ Q(n,r,cn). First suppose that max{r, k} > 3. If u r ^_\ < c < c r ^ then, by 
Theorem 11.11 we have k — 1 < x{G) < k a.a.s.. So x{G) = k a.a.s. in this case. If c T) t < c < u r ^ 
then Ur^-i < c rj fc implies that x(G) > k — 1 a.a.s., and u ri fc < c rj fc + i implies that x{G) < k + 1 
a.a.s.. Thus we have x(G) £ {k,k + 1} a.a.s. in this case. Next suppose that k = r = 2 and let c 
satisfy = U2,i < c < U2,2 < £2,3. Since c > the random graph G has at least one edge and hence 
X (G) > 2. Also, c < c 2 ,3 implies that x(G) < 3 a.a.s., by Theorem O Thus x{G) £ {2,3} a.a.s., 
completing the proof. □ 



Part (a) of Theorem 11.11 is easy, and is proved in Lemma 12.11 with the constants u r ^ = k r 1 ln/c. 



As in [3], parts [(b)] and (c) will be proved using the second moment method p. 54]. If Z is a 

od applies the inequalities 

< Pr(Z > 0) < E[Z] . (6) 



random variable defined on No, this method applies the inequalities 

E[Z] 2 



E[Z 2 ] 

Although based on a rather simple idea, the second moment method is often very laborious to 
apply, and our analysis will be no exception. 
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The random variable Z in (|6|) will be the number of balanced /c-colourings, so Z > implies the 
existence of a fc-colouring. However, the analysis in Section [2] will only allow us to conclude that 
c < c r> fc implies that liminf„_ s . 00 Pr(Z > 0) > 0. Thus, what we will actually prove is a weaker 
statement: 

(H ) Ifr, k > 2, there exist constants c T) ^ such that c < c r ^ implies liminf n _*. 0O Pr(x{G) < k) > 0, 
where G G G(n,r, en). 

Then part |(b)| of Theorem 11.11 will follow from the fact that there is a sharp threshold for k- 
colourability of a random hypergraph. Achlioptas and Naor [1] used a result of Achlioptas and 
Friedgut [lj which established that random graph fc-colourability has a sharp threshold. We will 
use instead the following, more general, result. 

Hatami and Molloy [10] studied the problem of the existence of a homomorphism from a random 
hypergraph G G Q(n,r,m) to a fixed hypergraph H. Given H = ([v},£h) G r, /i), they 

considered the threshold m(n) for the existence of a homomorphism from G = ([n], £q) G G(n, r, m) 
to H. A homomorphism from G to H is a function a : [n] — > [u] such that, for all e G <?Gj c(e) G 
If ii 7 is formed from by deleting duplicate edges then the homomorphisms from G to H' are 
identical to those from G to H, so we may assume that H has no duplicate edges. A loop in 
H is an edge e G <?# for which the underlying set is a singleton. A triangle in H is a sequence 
(ui, ei, f2, e2, t>3, 63) of distinct vertices «j G [v] and edges ej G £h (i G [3]), such that v\, v-i G ej, 
f 2 , G e2 and ui, U3 G 63. The following was proved in |10| (with minor changes of notation) : 

Theorem 1.3 (Hatami and Molloy). Let H be a connected undirected loopless r -uniform hypergraph 
with at least one edge. Then the H -homomorphism problem has a sharp threshold iff (a) r > 3 or 
(b) r = 2 and H contains a triangle. 

In this setting, a sharp threshold means that there exists a function c(n) = 0(1) such that, for all 
< e < 1, G G G(n, r, (1 — e)cn) has a homomorphism to H a.a.s., and G G Gin, r, (1 + e)cn) has 
no homomorphism to H a.a.s. 

Observation 1.2. The property of having an ff-homomorphism is a monotone decreasing property of 
G, that is, an //-homomorphism cannot be destroyed by deleting arbitrary edges of G. Monotonicity 
is a necessary, but not sufficient, condition for a property to have a sharp threshold. See [HI p. 12] 
for further information. 

Lemma 1.1. Suppose that k,r > 2 with max{A;,r} > 3. Then the problem of k- colouring G G 
G{n,r,m) has a sharp threshold. 

Proof. Take K = G £l*(k,r,[i) to be such that £k contains all r-multisets with elements in 

[k], except for the k possible loops. Then fi = ( fc+ ^ _1 ) — k. It is easy to see that the homomorphisms 
from G G G(n, r, m) to K are precisely the fc-colourings of G. If r = 2 and k > 3 then K contains a 
triangle. (We may take Vi = i and ej to be an edge with underlying set [3] \ {i}, for i G [3].) Thus 
it follows from Theorem 11.31 that the problem of /c-colouring G G G(n,r,m) has a sharp threshold 
unless k = r = 2. □ 



In the excluded case, which is the question of whether a random graph is 2-colourable, it is known 
that there is no sharp threshold [7]. 



We use Lemma [1.11 as follows. Suppose that max{r, k} > 3. From part (b') of Theorem 1 1.1[ we will 
have c rt k such that G G G(n,r,cn) and c < c r ^ implies liminf n _ s . 00 Pr(x(G) < k) > 0. Lemma [TTT1 
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then implies that the threshold function c(n) satisfies c(n) > c r fe. Thus for any c < c T ^ we have 
a.a.s. x{G) < fc, proving part |(b)| of Theorem ll.il 



In fact, we will prove an even weaker statement than (t/ 



' ) If r, k > 2, there exist constants c r ^ such that c < c r ^ implies liminfj-^oo Pr(x(G) < k) > 0, 
where G G Q*(kt,r,ckt). 



Observe that, in addition to restricting n to multiples of k, the random hypergraph model for (b' 



is different from that used in (b') We will now show that 



Lemma 1.2. (b" ) implies (V ' ) 



Proof. Let P*(n, m) = Pi(x(G) < k), where G G G*(n, r, m), and let (5(c) = liminft-^oo P*(kt, ckt). 
Then (b") is the statement that 5(c) > for all c < c T j~. 



Given n and c < c r ^, let t = \n/k\ and let d be such that c < d < c T ^. We show in Lemma 14.11 
that G G G*(n,r,cn) has at least (k — 1) isolated vertices a.a.s. Delete n — kt < k — 1 isolated 
vertices from G. This gives G' G Q*(kt,r,cn), by Observation I l.li So P*(n, cn) = P*(kt, cn) — o(l). 

Next, if n > dk/(d — c) then c'/ct > c'(n — k) > cn. Therefore, since fc-colourability is a monotone 
decreasing property (Observation [L~2j) , it follows that P*(kt, cn) > P*(kt,dkt). 



Finally, since d < c r ^, (b") implies that V*(kt,dkt) > 6(d) — o(l), with 5(d) > 0. Hence we have 

P*(n,cn) > P*(kt,cn) -o(l) > P*(kt,dkt) - o(l) > 5(d) - o(l), 
which implies lim inffj^oo P* (n, cn) > 5(d) > 0. 

Now let P(n, m) = Pr(x(G) < k), where G G G(n, r, m). Let G G £7(n, r, cn), where c < c r! fc. It was 
shown in Section fLTI that G can be obtained from G' G Q* (n, r, cn+b) by deleting b bad edges, where 
b = mbad < 2 In n a.a.s. Note that b is a random variable, and the bad edges are not a random subset 
of S, but this will not matter. Now G' G Q*(n,r,cn + b) can be obtained from G" G Q*(n,r,dn) 
by deleting (c' — c)n — 6 random edges, where c < d < c r ^. Note that (d — c)n > b a.a.s. Since 
d < Cj.)., we have P*(n,dn) > 5(c") > 0, for any c" such that d < c" < c r ^. Now, by monotonicity 
(Observation [L2]) and Lemma ll.l| P*(n,dn) > 5(c") implies that P*(n, cn + b) > 5(c") — o(l), 



which in turn implies that P(n, cn) > 5(c") — o(l). Thus liminfn^oo P(n, cn) > 0, giving (b') □ 



The remainder of the paper will be devoted to proving Theorem ll.lt with part | (b ) | weakened to (b" 



First we obtain expressions for E[Z] and E[Z 2 ] in Sections 12. II and 12. 2\ respectively. The expression 
for E[Z 2 ] is analysed using Laplace's method, under the assumption that part |(c)| of Theorem 11.11 
holds. This requirement is established in Section [3l completing the proof. Some remarks about 
asymptotics are made in Section [3.61 

The analysis of Section [3] will require many technical lemmas, some merely verifying inequalities. 
These inequalities are obvious for large r and k but, since r and k are constants, we need to establish 
precise conditions under which they are true. We relegate the proofs of most technical lemmas to the 
appendix, since they complicate what are fairly natural and straightforward arguments. Therefore, 
whenever we use a lemma without proof, the proof can be found in the appendix. 

To complete this section, we prove the result corresponding to Theorem 1 1.21 for the Bernoulli random 
hypergraph model Q(n,r,p). Recall that u r ^ = k r ~ 1 lnk and N = ("J. 

Corollary 1.1. Let r, k > 2. For c G (0, oo), let k(c, r) be the smallest integer k such that c < u r ^. 
IfGe Q(n, r, cn/N) then x(G) G {k(c, r), k(c, r) + 1} a.a.s. 
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Proof. Let G G Q(n,r,cn/N), and let m be its (random) number of edges. Then Chernoff's 
bound [HJ Corollary 2.3] gives 

Pr (\m - cn| > m 3/4 ) < 2e" cv ^ /3 . 

Therefore cn(l — re -1 / 4 ) < m < cn(l + ra -1 / 4 ) a.a.s., and hence c'n <m < c"n a.a.s. for any positive 
constants c 7 , c" such that d < c < c" . 

Let k = k(r,c), so u r ^_i < c< u r ^. Choose d G (-u^-i, c), so m > c'n a.a.s. Now, conditional on 
m > dn, d > u r ^-i implies x{G) > k a.a.s., by Theorem 11.21 and Observation 11.21 

Similarly, choose c" G (c, c r ^+\)-, so m < d'n a.a.s. Then, conditional on m < c"n, c" < c r ^+\ 
implies x(G) < k+1 a.a.s., by Theorem II .21 and Observation [L2j Thus x{G) G {k, k + 1} a.a.s. □ 

Remark 1.1. We have shown the equivalence of various models for our problem when max{k, r} > 3. 
We note that this equivalence does not hold for the case k = r = 2, where the non-existence of a 
2-colouring is equivalent to the appearance of an odd cycle in a random graph. This is due to the 
absence of a sharp threshold for this appearance. See [6j for further information. Fortunately, this 
has little impact on our results. 



2 Moment calculations 
2.1 First moment 

Lemma 2.1. Let r, k > 2 and recall that u Tt k = k r ~ l \nk. Suppose that c > u Tt k- Let G G 
G*(n,r,cn). Then a.a.s. x(G) > k. 

Proof. Consider any /c-partition a G with block sizes rij {i G [k]). Given a, a random edge e G £ 
is monochromatic with probability Y^i=i( n i/ n Y — k(l/k) r = l/k r ~ 1 , using Jensen's inequality [9] 
with the convex function x r . Since the edges in £ are chosen independently, the probability that a 
is a /c-colouring of G is at most (1 — l/k r ~ 1 ) cn . Let X be the number of fc-colourings of G. Using 
dSJ) and the fact that |H fe | = k n , we conclude that Pr(X > 0) < B[X] < (k (1 - l/Af- 1 ) )™ If 
c > u r ^ then 

A ; (l-^ I ) C = exp(lnA ; + C ln(l- F=T )) < exp (in k - ^zr^ ) < 1, 

if c > (k r ~ 1 — 1 /2)lnk, where we have used Lemma 14.51 in the penultimate inequality. It follows 
that Pr(X > 0) — > as n — > oo when c > (k r ~ l - 1/2) In A;. Clearly (k^ 1 - l /2) In k < u r ^ k . □ 

Remark 2.1. We have proved the slightly stronger bound (A; r ~ 1 — 1 /2) Ink. This is used in [3], and 
noted, but not used, in [3]. Since the difference is small, we mainly use the simpler bound /j r_1 lnfe. 
Note also that u T) \ = 0, in agreement with our definition above. 

In the remainder of the paper, we will assume that k divides n, unless stated otherwise. Thus, 
if k divides n, let Z be the number of balanced colourings of G G G*(n,r,cn). For any balanced 
partition a G and any e C [n], let M e (a) be the event that | cr(e) | = 1. If e is an edge of 
G G Q*(n,r,cn) then clearly Pr(M e (cr)) = 1 — l//c r ~ 1 , and these events are independent for e G £. 
Thus, since = n\j ((n//c)!) fc , 

n\ { 1 \ cn k k/2 ( ( 1 \c\n 
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2.2 Second moment 



Using the notation of Section \2.1\ let a, r G be balanced partitions. Then M e {a) n M e {r) is the 
event that the edge e is not monochromatic in either a or r. For i,j G [A;], define 

£ij = \{v G [n] : a(v) = i, t(v) = 
Let L be the k x k matrix (lij). Then L G T>, where 

V = {L G N fcxfc : Eti^ = E;=i*« = n/k}. 

There are exactly n!/( riij=i ^<iO P a hs cr, r G which share the same matrix L G T>. 
Now Pr(M e (a)) = Pr(M e (r)) = l/fc r ~ 1 , and 

Pr(M e (a)nM e (r)) = ^^(^. 

i=i i=i 

Thus by inclusion-exclusion, 



Pr(M e (<x) n M e (r)) = 1 - Pr(M e (a)) - Pr(M e (r)) + Pr(Af e (a) n M e (r)) (8) 

i=l j=l 



Therefore 



cr,reS fc l,J=l 

\ cn 

n! / 2 A /&,-\ r 



Lex> 1 lij'=l tl J • \ » J=l 

For X = (xjj) £ R hfc , define the functions 

F(X) = -^^^Tn^ + cln(l--^- + ^^4), (10) 

i=i j=i 

G(X)= (2vrn)" (fc " 1)/2 (n^i^)" 1/2 - 



If all the lij — > oo then applying Stirling's inequality to the summand of (|9|) gives 



EL 1=1 A?' 



2 / lij \ r 

1 ~ kF 1 + ^ W 



. n / \ u J \ k ' — ' v n 



8 



= G{L/n)e nF ^ L ' n \ (11) 

On the other hand, if some of the are bounded then the summand of ([9]) equals 0(1) times the 
final expression in (lllh . again using Stirling's inequality. 

Let Jo be the k x k matrix with all entries equal to 1/k 2 . Then 

F(J ) = 2lnk + 2cln (l - l/k r ~ l ) = In (k(l - ^i)")' , (12) 
G(J ) = (2^n)"( fe2 - 1 )/ 2 k k2 . (13) 
Hence the term of © corresponding to L = n«7o is asymptotically equal to 



(27rn) (^-D/n v F-i 

Observe from ([7]) that this term is smaller than E[Z] 2 by a factor which is polynomial in n. We 
will find a constant c r ^ such that when c < c r fc, the function F(X) has a unique maximum at 
X = Jq. This will allow us to apply the following theorem of Greenhill, Janson and Rucihski [8] 
to estimate E[Z 2 ] in the region where c < c r ^. (See that paper for background and definitions.) 

Theorem 2.1 (Greenhill et al. [8]). Suppose the following: 

(i) C C R N is a lattice with rank r. 

(ii) V C R N is the r- dimensional subspace spanned by C. 

(Hi) W = V + w is an affine subspace parallel to V, for some w G R N . 
(raj K C R N is a compact convex set with non-empty interior K°. 

(v) (J) : K — > R is a continuous function and the restriction o/cf) to Kfl W /ias a unique maximum 
at some point xq G K° n W. 

(mj c() is £u>ice continuously differ entiable in a neighbourhood of xq and H := D 2 cf)(xo) is its 
Hessian at Xq. 

(vii) 4> : Ki —7- R is a continuous function on some neighbourhood Ki C K o/xo wit/i ^(xo) > 0. 
fmii) For eac/i positive integer n £/zere is a vector l n G R N luit/i ^n/ 1 " 1 - £ W. 
^ixj For each positive integer n, there is a positive real number b n , and a function 
a n : (C + t n ) n nK — > R suc/i that, as n — > oo, 

a n (£) = 0(b n e n * (£/n)+o(n) ), £ G (£ + £ n ) n nK, 

and a n («) = b n (r|j(£/n) + o(l))e n * (e/n) , £ G (£ + £ n ) n nKi, 

uniformly for I in the indicated sets. 
Then provided det(— H|v) 7^ 0, as n — > 00, 

V/2, 



□ 



£e(£+«Ti)nn.K 
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the asymptotic approximation given by this theorem remains valid for n € I, 

hold for all n 6 I. The conclusion of the theorem 



ix. 



As remarked in 

where I C N is infinite, provided (viii) and 
then holds for n 6 I as n — > oo. We will use this observation with I = {kt : t E N}, since we require 
only the weaker statement (b") in Theorem ll.il 

We must relate the quantities in Theorem l2.1l to our notation and analysis. We let n be n, restricted 
to positive integers divisible by k. Denote by M fcxfe the set of real k x k matrices, which we will 
view as fe 2 -vectors in the space M. k . Then N = k 2 in Theorem 12. 11 Next, V in Theorem 1 2 . 1 1 will be 
the subspace M of M. kxk containing all matrices X such that all row and column sums are zero, 
i.e. 

ELi x ij = ELl x ii = U e M) i 

and the affine subspace W will consist of the matrices X such that all row and column sums are 
1/k, i.e. 



k 

i=l 



x 



lA (je[fc]). 



The point w E W will be Jo- 

The lattice £ in Theorem 12.11 will be the set of integer matrices in A4: that is, the set of all k x k 
integer matrices L = such that 



i=l — 2_/»=i 







Let ^ n equal the diagonal matrix with all diagonal entries equal to n/k. Then l n S W and £ n is an 
integer matrix, since we assume that n is divisible by k. 

The compact convex set K will be the subset of R fcxfc such that < x^j < 1 /k (i,j G [k]), which has 
non-empty interior Kq = {x^ : < < 1 /k}. Define a n (L) to be the summand of ©; that is, 



n 



i,j=l ^3- 



We wish to calculate E[Z 2 ], which by ([9]) equals 



Le(C+£ n )DnK 



(14) 



In Section [3] we will prove the following result. 

A; r ~ 1 lnA; for k,r > 2. iVoro fix r,k > 2. There exists a positive 



Lemma 2.2. Recall that u r k 
constant c T ^ which satisfies 



u T) k-i < c T) k < u T) k and c ri k < 



(k 



r-l 



1) 



r(r — 1) ' 

such that F has a unique maximum in K n W at the point Jq £ Kq nW whenever c < c r ^k- 



Throughout this section we assume that Lemma 12.21 holds. Then J$ is the unique maximum of F 
within K n W, so we set cj) := F and xo := Jo- Note that F is analytic in a neighbourhood of Jo- 
Let Ki be any neighbourhood of Jo whose closure is contained within Ko- The function il> m 
Theorem 12.11 will be defined by i\>(X) = nfj=i x 7j- ^° ^ ^ s positive and analytic on Ki. We 



10 



let b n equal (27m) ^ k X ^ 2 . The quality of approximations required by (ix) of Theorem 12.11 follow 
from (fTUj) and the paragraph below it. 

Next, observe that C and M. both have dimension (k — l) 2 , since we may specify iij or X{j (i,j E 
[k — 1]) arbitrarily, and then all iij or Xij G [k]) are determined. Thus r = (k — l) 2 in 
Theorem 12.11 

We now calculate the determinants required in Theorem 12.11 Let H be the Hessian of F at the 
point Jo- This matrix can be regarded as a quadratic form on R fexfe . In Theorem 12.11 we need 
the determinant of — H\mi which denotes the quadratic form —H restricted to the subspace Ai of 
M fexfc . This can be calculated by 

, , TT . , det U T {-H)U 

■hK-BUd- det { uTu > (15) 

for any k 2 x (k — l) 2 matrix U whose columns form a basis of Ai. 

Lemma 2.3. Suppose that k,r > 2 and c < c r ^, where c r ^ satisfies Lemma \2.2l Then the 
determinant of £ is detC = k k ~ l and the determinant of —H\m is (k 2 a.)^^ 1 ^ 2 , where 

cr(r — 1) 

a = 1 



(jfer-l_l)2- 



Proof. Let Sij be the Kronecker delta, and define the matrices Eij by (Eij}^., = 5u>5jji. Then 
{Eij : i,j G [A;]} forms a basis for M fcxfc . Let E^ be such that (E^.,., = dw, and E*j be such 
that [E^j^,., = Sjji. Then Ai is the subspace of IR fcxfc which is orthogonal to {E^, E^- L : i 6 [k]}. 

We claim that the vectors Uij = E^ — E^ — E^j + E^h (hj £ [k — 1]) form a basis for Ai. To 
show this, consider elements of M. kxk as vectors in IR fc2 (under the lexicographical ordering of the 
indices (i, j), say). Then for i',i,j 6 [k — 1], taking dot products in M fc gives 

k k 

Ei>* • Uij = ^2 y){Ei'*)tl' (Uij)u' 

k k 

= ^2 ^2 bi'^it^ji' - 5ii5ke> - 5ki5j£/ + 5ki5ke>) = 8a> - 5n> -0 + = 0, (16) 
i=\ t>=\ 



k k 

y] <W (5i£5je - 5ii5k£i - bktbji 1 + 6kt$ki') = fa'j - - Si'j +0 = 0. (17) 

(18) 

Thus, from (|16p and (|17p . the (k — l) 2 vectors E/y lie in Ai, so we need only show that they are 
linearly independent. We will do this computing the determinant of the corresponding (k — l) 2 x 
(k — l) 2 Gram matrix M. Let U be the k 2 x (k — l) 2 matrix with columns Uij £ [k — 1]). 
Then M = {miji'j') = U T U, and we calculate (taking dot products in M fc2 ), 

— Uij ' U i> ji 



and, similarly, 

E*i' ■ Uij — 
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k k 



i=\ t'=\ 

5ui5jj> + <5jj/ + Sjf + 1. 



M 



l)x 


(k- 


1) block matrix, with blocks of size (k 


-1 


2B 


B 


... b~ 




"2 1 


1" 


B 


2B 


... b 




1 2 ••• 


1 








, where B = 






B 


B 


'-. B 




11 '■■ 


1 


B 


B 


■■■ 2B 




1 1 ••• 


2 



We compute the determinant of matrices of this form in Lemma 14,21 Taking p = q = k — lin 
Lemma \4.2\ we have detM = k^ 1 ^^ 1 = k 2 ^ k ~ l \ In particular, since the determinant is nonzero, 
it follows that the G [k — 1]) give a basis for Ai. Also, note that 



U 



h- 



U' 



where Ik-i is the (k — l) 2 x (k — l) 2 identity matrix, and J7' is a (2/c — 1) x (k — l) 2 integer matrix with 
entries in { — 1,0, 1}. It follows that {U ij ■ i, j G [k — 1]} is a basis for the lattice C, since X = UY 
if and only if = Xjj (i, j G [A; — 1]). Therefore the determinant of £ is det£ = Vdet M = k k ^ 1 . 

We also require the determinant of — H\m. For X G M, let 



fe fc 

F 1 (X) = -^^x lJ ln: 

i=l i=l 



A: k 



'J • 



i=l i=l 



Then = {hij^jn has entries 



a 2 F x 



dxijdxi'ji 



+ c 



Jo 



^ lnF 2 

dxijdxi'ji 



Now 



Next, 



Hence 



r a 2 F 1 - 




1 " 


Qx ^ j Qx tfji 


Jo 





dlnF 2 1 dF 2 



dx 



Jo 



and 



-k Su'Sjji. 



'./ 



d 2 lnF 2 

dxijdxi'ji 



F 2 dxij 

1 9 2 F 2 
F 2 oxijdxi'f 



r(r - l)a£. 2 



F 2 
r(r — 1) 



dF 2 r-1 
^ — = rx u • 

OXij 



1 <9F 2 <9F 2 



_2 r-l r-l 



F 2 

r 2 



Jo 



A ,2(r-2)( 1 _ 1 / A; r-l)2 li M fc4(r-l)(l _ 1 /fcr-l)4 
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k 2 r(r — 1) r 2 



(jfcr-l _ 1)2 (fcr-l_l)4- 

Here we have used the fact that ^(Jo) = (1 — l/k r ^ 1 ) 2 . 
These calculations show that 

-H = k 2 otI k + |3J 

where I k is the k 2 x fc 2 identity matrix, J is the A; 2 x k 2 matrix with all entries equal to 1, 

cr(r — 1) „ cr 2 



(^-1-1)2' (fcr-l_ 1 )4- 

By (fT5|) . the determinant of — equals 
detU T (-H)U detU T (k 2 aI k + $J)U det{k 2 ot)U T U {k 2 oc)^ k -^ 2 det U T U 2 (fc _ 1)3 



detU T U det U T U det U T U det U T U 



Here we have used the fact that JU = 0, which follows since every column of U is an element of 
M. and hence has zero sum. This completes the proof. □ 

Note that \|>(Jo) = k k ' 2 , while fTJ gives ct.(J ) = F(J ) = 2 In (Jfe(l - l/F" 1 ) ) . Now a is positive, 
which follows from Lemma 12.21 Hence Lemma 12.31 guarantees that det(— H\m) 0- Therefore we 
can apply Theorem 12.11 to @, giving 

k k / / 1 \ c\ 2n 



(2 7 rn) fc - 1 a( fc - 1 ) 2 /2 V V fc^-l 
Thus, from ([7]), for all r, k > 2 we have 



which is a positive constant. So limkuv^oo Pr(Z > 0) > and we have established part (b") of 
Theorem II. H 

It remains to prove Lemma 12.21 which is the focus of the next section. 

3 Optimisation 

We now consider maximising the function F in (|10p . and develop conditions under which this 
function has a unique maximum at Jo- I n doing so, we will determine suitable constants c r)k and 
prove that Lemma 12.21 holds. This will complete the proof of Theorem I l.li 

Our initial goal will be to reduce the maximisation of F to a univariate optimisation problem. 
This reduction involves several stages, which we do in Section 13.11 We analyse the univariate 
problem in Sections l3.2H3.51 Finally we consider a simplified asymptotic treatment of the univariate 
optimisation problem in Section 13.61 
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3.1 Reduction to univariate optimisation 

It will be convenient to rescale the variables, letting A = (a«) be the k x k matrix defined by 
A = kX, so ciij = kxij for all i,j E [k]. Substituting into ([TP]) , we can write 

1 k k ( 2 \ 

F(X) = In k - - E ai i ln °y + cln ( 1 " k^T + ) 

i=l j=l 

where 

k k 
i=l i=l 

Letting z = F(X) — Ink, we consider the optimisation problem 

k k 



maximise z 



1 / 2 p \ 

-^^flijlDOii+clll 1- (19) 

t=l i=l v y 

subject to EE a li = T^z? > ( 20 ) 

(i E [*]), (21) 

(J 6 [*]), (22) 

(i,i€[fc]). (23) 
In any feasible solution to this system we have 

p = r2 EE<i ^ ^ 2 £(£^) r = v- 1 ( 24 ) 



t=l i=l 




k k 




EE«^ 


fc' 


i=i i=i 


k 




£«*; 


= 1 


i=i 




fc 




E a ^' 


= 1 


i=i 






> o 



i=l j=l i=l j=l 



and 



* * r kr-^EllZU^) k-^ 

2^2^ a n ^ —r—u 1 — n 



where we have used Holder's inequality [9J in (|25p . If p ^ [1, fc r ], the system is infeasible, and we 
may take maxz = — oo. 

We wish to determine the structure of the maximising solutions in the optimisation problem (|19|) - 
(1231) . Following @], we relax J22J and (J23J), and write j5DJ) as 

E<i = ^f' E^ = ^- 

j'=i i=i 

By the same method as for (I24p and (|25h . we can show that 1/k < qi < k r ~ 2 in any feasible 
solution. Note that (|2"4"|) and (f25j) assume ctjj > 0, but the relaxation of ([23]) is unimportant, since 
z is only defined for all dij > 0. 
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Consider any fixed feasible values of the Qi (i G [k]) such that Y^j=i Qi = 9- Then the problem 
decomposes into k independent maximisation subproblems. We will use Lagrange multipliers to 
perform the optimisation on these subproblems. We temporarily suppress the subscript i, to write 
a = (oi, a,2, . . . , aj;) for the ith row of A. The subproblem is then 

k 

maximise z\ = — aj In aj (26) 

3=1 
k 



subject to 5Z°i = T^=2' ( 27 ) 



Q_ 

3=1 
k 



aj > 0. (29) 

We assume that 1/k < g < k r ~ 2 , so the problem is feasible. Note that the form of z\ implies that 
it is not maximised on the boundary of the feasible set, since dzi/daj = +oo when aj = 0. 

Introducing the multiplier A/r for (|27l) and fj, for (l28j) . the Lagrangian is 

maximise L(a,A, //) = — ^^aj In a^H — f a j ~ ) ^ ( flj ~ 7 ' 

3=1 3=1 ' i=i 

The maximisation gives (|27p and (|28|) . together with the equations 

<f(aj) = (j G [k]), where ip(x) = — 1 — lnx + \x r ~ l + \i. (31) 

If the equation <p(x) = has only one root then, from (|28p . aj = 1/k (j G [n]) and hence, from 
(|2T|) . £ = 1/fc. (Note that there is at least one root since the objective function of ([26]) - (f29]) is 
defined only on a bounded region, and we have assumed the problem is feasible.) 

Now suppose that the equation <p(x) = has more than one root, and let a be the largest. If a 
satisfies aj / a for some j G [k] then subtracting the corresponding equations in ([3T|) gives 



That is, 



In a - Inoj - A(a r " 1 - a^ 1 ) = 0. 



A= lna ~ ln ^ >0 , 



Hence, since — lnx and x r ~ 1 are both convex on x > and A is positive, p(x) is a strictly convex 
function. It follows that the equation <p(x) = has at most two roots in (0, 00). Let the roots of 
<p(x) = be a and /3, where we assume that a > f3. We have aj G {a,/3} for all j G [k]. But we 
still need to determine how many of the aj are a and how many are j3. 

Consider any stationary point a of L, and suppose without loss of generality that for some 1 < t < 
k — 1 we have ai, . . . ,a t = a, a t+ i, ...,% = /3. At a the Hessian ii" of the Lagrangian is a x A; 
diagonal matrix with 

hjj = <p\a) = -- + (r - l)Aa r - 2 (j = 1, . . . ,t), 
a 
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h js = ip'(P) = ~ + (r- l)XP r - 2 (j = t + 1, . . . , k), 
h jf = U,j'e[k], j^j'). 

Since (p is strictly convex with zeros j3 < a, we know that <p'(j3) < < f'(a)- The quadratic form 
determined by the Hessian at a is 

t k 

x T Hx = y/(a)J>f + <//(/?) 4 ( 32 ) 
j=i j=t+i 

To determine the nature of the stationary point a, we restrict the quadratic form to x lying in the 
tangent space at a. This means that x satisfies linear equations determined by the gradient vectors 
of the constraint functions at a. See, for example, |14| . In our case, these equations are 

cr^Xi + fT- 1 E Xj = o, 
j=l j=t+l 

t k 

J2 x 3+ E x j = °- 

j=l j=t+l 

These equations are linearly independent since a > (3. They can be solved for x±,Xk to give 

X% = - Yj)=2 X i' x k = ~ Sj=t+1 X j in (133) • Tnen 

x T Hx = if/(a)(j2x^ +^{a)Y, x ) + ^'^) E *f + ^(/3)( E *i) ■ ( 33 ) 

j=2 j=2 j=t+l j=t+l 

For a to be a maximum, the right hand side of ([33)) must be negative for all X2, X3, ■ • • , x^-i such 
that x ^ 0. Since </?'(a:) > 0, v?'(/3) < 0, this will be true if and only if t = 1, when the terms 
with coefficient in (|33p are absent. This local maximum is clearly unique up to the choice of 

j £ [k] such that a 3 - = a, and hence is global, since z\ is bounded on the region determined by ([27]) 
to (|29p . Thus there are A; global maxima, given by choosing p 6 [/c] and setting a p = a, aj = f3 
(j ^ W\->3 7^ p); where (a,/3) is the unique solution such that a > /3 to 

a r + (fc - l)/3 r = k 2 ~ r 0i , 
a + (k— l)/3 = 1. 

Note that the relaxation of the constraints (J23]) proves to be unimportant, since the optimised 
values of the £ {a, /3} are positive. Thus the optimisation (f26j) - ([29|) results in the system 

maximise z\ = —a In a — (k — l)/31n/3 
subject to a r + (k — l)j3 r = k 2 ~ r g, 
a + {k-l)(3 = 1, 
a > p. 

We have included a > f3 here, though it is not a constraint, simply a labelling of the solution. 
Similarly below. 
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The combined problem over all i G [k] can therefore be written as 



maximise z 2 = — (cti In oti+{k — l)/3j In (34) 

fc i=l 

subject to ^ [a r i + (fc - l)/3[) = k 2 ~ r p, (35) 
i=l 

a, + (k - = 1 (i € [&]), (36) 
a* > A- 

We will again apply the Lagrangian method, using the multiplier A/r for (|35p . In order to avoid 
using k multipliers for ([55]) . we consider the variables to be (3 = (f3i,(3 2 , ■ ■■ and define the 
Lagrangian as 

k / k \ 

= " E («* lna ' + ( k - !)A ln&) + - ( £ K + ( fc " i)/ 3 !) " I • 

i=l r \i=l J 

The aj are then defined through ([36]) as Oj = 1 — (fc — 1)/%, and hence daj/d/?j = — (fc — 1). 
Thus the stationary points of the Lagrangian are given by (f35jh ([36]) and the equations 

~' <>l " Inoi-InA-AK- 1 -^- 1 ) = (i €[*;]). (37) 



k-ldpi 



For each j € [A;], a solution to (f35|) - ([37|) has either aj = /3j or a, > /3j. In the first case we have 
cti = [3i = 1/k. In the second case, cej,/3j must satisfy the equations 



A = J? ° _ ^ , a + (fc-l)j8 = 1, (38) 



and these equations are identical for all such i. Thus for each i, either = 1/A; or is a solution 
to (p8|) . Suppose that solves ([38]) for (i = 1, . . . , t) and = l/fc for (i = i + 1, . . . , k). 

Remark 3.1. In fact, it can be shown that (|38p has at most two distinct solutions. We do not use 
this fact below, so we will not prove it. 

Now the optimisation (j3"4"l) ~ (j36"j) results in the following system: 

t 

maximise z 2 = kink — (In k + 014 In 014 + {k — l)/3j In /%) (39) 

i=i 

t 

subject to k 2 - r (p-l) = ^{a\ + {k - - k l ~ r ), (40) 

i=i 

a t + {k- l)Pi = 1 (*€[*]). (41) 

Let us write 

f(/3) = lnk + alna+(k-l)(3lnf3, gtf) = a r + (k - l)f3 r - -L , v (f3) = ^ . (42) 
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Then (J39J) and (|40p may be rewritten as 

t 

maximise z 2 = kink — /(A) (43) 
subject to fc 2 ~ r (p - 1) = ^ <?(&) . (44) 



Suppose, without loss of generality, that 

Then multiplying ((HJ) by 7?(/3i) and adding to ([15]) gives 

t 

maximise z 2 = fclnfc — fc 2 ~ r (p - l)r?(/3i) - ^g(A)(??(A) ~ ^(ffi)) ■ 

i=l 

Since we can choose the (i £ [t]) to be any roots of (|38p . z 2 is clearly maximised by taking 
Pi = Pi (i £ [t]) provided that (jjj]) is satisfied. Thus, we need only use the solution /3i of (f38j) 
which minimises r/(f3). If there are alternate minima (with r/(f3j) = r]{(3\) for j £ [s], say) then the 
maximum value of z 2 is unchanged by using any of the solutions /3j (j £ [s]). However, since we 
use the maximum value of z%, and not the maximising solution, it suffices to use only /3\. Thus we 
will simply write j3 for /3\ from this point. 

The resulting system is now 

maximise Z2 = k In k— k 2 ~ r (p — l)r/(/3) (45) 
subject to k 2 ~ r (p - 1) = tg{(3) , (46) 

so (f46l) can be satisfied by taking t = k 2 ~ r (p — l)/g((3). Note that we should have t G No and t < k, 
but we can only increase the maximum value of Z2 by relaxing these constraints. 

Let J be the k x k matrix with all entries 1/k, and note that J/k = Jo- We wish to find conditions 
on c which guarantee that F{A/k) < F(J/k) for all A / J which satisfy (|20l) . (J2T|) , From the 
above, and (JT9J) , this will be true precisely when 

^ + cln f 1 - — r + rnf-s I < lnfe + 2cln ( 1 ' 



k \ k'~ l k 2r ~ 2 / \ fc r ~ * 

Substituting z 2 = fcln/c — tf((3) from (|43p . this can be rewritten as 



cin ( 1+ jm ? )<lm- <4T) 

Substituting (p - 1) = tg(j3)k r ~ 2 , from (jMJ), in ([47]) gives 
Let 

^) ^ ^M), and C(/3) = ^=^^). (49) 
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Then (|48p can be written as 



i=0 v 

Now the right side of (|50p is clearly minimised when $ is as small as possible. This is when i9 = 0, 
and then we have c < C(/3). Thus we can estimate c r ^ by minimising C(/3). Then the computation 
of c r> fc reduces to minimising the function 

n(0) lnfc + (1 - (fc — ln(l - (fc - l)/3) + (fc - W In p (0<3<1/k) (51) 

V[fj) - (l-^-lJ^ + Cfc-l^-l/fcr-l (U</^<l/fc). (51) 



Therefore we may take 

(fc 7 - 1 - l) 2 
fc r ~ 1 "/3 



c r , fc = V Lr _ 1 7 min V (P) . (52) 



Then, whenever c < c rj fc, we know that (|50p holds, and hence that J is the unique maximum of F 
over all doubly stochastic matrices. 

Remark 3.2. We have taken 1? = in (148 j) . when the smallest possible value is clearly larger. 
Observe that we cannot have t G {0, 1}, or else (|2ip and (|22p imply that A = J. Thus we can 
assume that t > 2, and then (1381) becomes 



c < 2 m m 

- kln(l + 2k r ~ 2 g(P)/(k r - 1 - l) 2 )' V ' 

We could use (j53|) directly to improve the estimate of c r ^. This is done in [3] for fc = 2, giving a 
small improvement in c rj 2, though [4] uses only f)52|) for r = 2. We will use (j52p . but we show in 
Section 13.61 that the increment in c r ^ resulting from (|53|) can be obtained indirectly, by using (|5Up . 

Remark 3.3. We might improve the estimate of c r j~ further by avoiding the relaxation of (|22p in the 
optimisation. We note that taking t = fc in (148p results in a local maximum of (119p -( l23j) . as follows. 
Let p be any permutation of [fc], and set a ip ^ = a, a^- = /3 (J ^ p(i), i G [fc])- This gives fc! local 
maxima of f)19j) — f)23j) . We conjecture that these solutions are the global maxima, but we are unable 
to prove this. The inclusion of (|22h gives conditions for local maxima which have solutions that 
might yield larger values of z in (|19p . These solutions seem rather difficult to describe explicitly, 
so we leave this as an open question. However, we show in Section 13.61 that including (]22p cannot 
result in a large improvement in c r & . 



3.2 Minimising r) 

We have now achieved the objective of reducing the problem to a univariate optimisation, namely, 
minimising the function 77. To carry out this minimisation, we first make a substitution x = (fc — 
in (|5T1) . so that 

lnfc — xln(k — 1) + (1 - x)ln(l - x) +xlnx f(x) 

= r, ^ —rk 7T—r — „ 77 1 = -^7-7 (x G 0, 1 - 1/fc ), (54) 

n ' (1 - x) r + x r /(k - ly- 1 - l tf- 1 g(x) v L ' iJ K J 



where, as defined in ([4"2]) . 

f(x) = In fc — x ln(fc — 1) + (1 — x) ln(l — x) + x hix, 
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g(x) = (1 - x) r + x r /{k - l) 1 - 1 - l/k 1 " 1 . 

Now 

f{x)= ln(/c(l-x)) - a; In ((Jfe - 1)(1 - x)/x) 

for x G (0, 1 — i/fc), and at the boundaries we have /(0) = In A; and /(l — i/fc) = 0. Differentiating 
gives 

f'{x) = - ln(Jfe - 1) - ln(l - x) + hix (55) 
= -ln((fc-l)(l-x)/x) <0 for x G (0,1-l/fc). 

Therefore /(x) > /(l- V*) = f °r all x G (0, 1-i/fc). Also lirn^o /'(a?) = -oo while /'(1-1/k) = 0. 
Note, using (f55l) . that 



/(x) = In (jfe(l - x)) + xf'(x). (56) 

Also, 

/"(x) = — ?— + - = 1 > for x G (0, 1 - 1/fc], (57) 
1 — x x x(l — x) 

'"W-IR'-? (58> 

We note that f"(l - i/*) = k 2 /(k - 1) and f"(l - i/fc) = fc 3 (/c - 2)/(Jfc - l) 2 . Now we turn our 
attention to the function g, which satisfies g(0) = 1 — l/k r and g(l — 1 /k) = 0. Differentiating 
gives 

g '( x ) = - r ((l - x) r ~ l - x T - l j(k - l)^ 1 ) < for x G (0, 1 - i/fc), 

which shows that #(x) > gr(l - J/fc) = for x G (0, 1 - i/fc). Also g'(0) = -r and g'(l - 1 /k) = 0. 
Finally, 

9 "{x) = r{r -!){{!- xf- 2 + x r - 2 /{k -If- 1 ) >0 for x G (0, 1 - i/k], (59) 
/'(a;) = -r(r - l)(r - 2)((1 - x) r " 3 - x r ~ 3 /(A: - l) r-1 )- 

Note that, when r = 2, g" is constant and g'" is identically zero. Also, in particular, 

»h i/u\ r(r - 1) i/^ r(r-l)(r-2)(fe-2) 

Hence /(x) and g(x) are positive, strictly decreasing and strictly convex functions on (0, 1 — 1 /k). 
Returning to the function 77 defined in (I75p . in Lemma 14.81 we show that 

f~r-l ffc _ 2)fcr 

lim n(x) = — -, lim rf(x) = — oo, lim rfix) = — — — > 0, (60) 

s-KL-i/fe ' r(r — 1) x^o x-^l-i/k r(k — 1) 

and we will take these limits as defining 77(1 — l /k), r/(0) and r/(l — l /k) } respectively. Note also 
that 77(0) = jfe r - 1 lnlfe/(A: r - 1 - 1). 

If k = 2 then 77 has a stationary point at x = 1 — l /k = 1/2. Otherwise, 77 has an interior minimum 
in (0, 1 — i/fc), since 77' (0) < and 77' (1 — 1 /k) > 0. We first show that this is the unique stationary 
point of 77 in (0, 1 — l /k). This is not straightforward, since 77 is not convex, as observed in [4] for 
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the case r = 2. And, unfortunately, the approach of [3J, making a nonlinear substitution in 77, does 
not generalise beyond r = 2. Hence our arguments here are very different from those in [3]. 

To determine the nature of the stationary points of 77, we consider the function h{x) = f(x) — \g(x) 
on (0,1 — 1 /k], for fixed A > 0. Then h is analytic, and its zeros contain the points at which 
r](x) = A in (0, 1 — i/fc]. We will apply Rolle's Theorem [15J to h. The zeros of h are separated 
by zeros of h', and these are separated by zeros of h". Since /(l — 1 /k) = g{l — 1 /k) = and 
/'(l — l/fc) = g'(l — l /k) = 0, we conclude that b! has a zero at x = 1 — l /k for all A, and h has 
a double zero at x = 1 — i/fc. Now, from (|57p and (|59p. the zeros of h"{x) = f"{x) — Xg"(x) in 
(0, 1 — l /k] are the solutions of 

In Lemma [4.9l we show that if r < 2/c then (|6ip has at most two solutions in [0, 1], while if r > 2k + 1 
then (foTj) has at most two solutions in [0, 1 — 1 /k] whenever A < Ao, where 

1 , N /(r-2)2 r - 1 1 
— = rr-1 ^ i + — 



A V rT kr 

(Here, as elsewhere in the paper, we have r,k > 2.) For uniformity, we set Ao = oo if r < 2k and 
define 

A = {x G [0, 1 - i/fc] : jy(a:) < A } , A = A n (0, 1 - i/k). 

Then A' is a union of open intervals. We show in Lemma 14 . 1 1 1 that 7/(0) < r/(l — 1/fe) < Ao, which 
implies that 0, 1 — i/fc G A. Hence A = A' U {0, 1 — 1 /k\, which shows that A is nonempty. Now 
r/(0) < 0, 77' (1 — 1 /k) > imply that A' is nonempty. Our search for a value of x making 77 small will 
be restricted to A'. We have shown that h" has at most two zeros in A, and hence h has at most 
four zeros in A. Since there is a double zero of h at x = 1 — i/fc G A \ A', it follows that there are 
at most two zeros of h in A'. Thus, 77(21) = A at most twice in A'. Since 77' (0) < 0, 77' (1 — 1 /k) > 0, 
we know that 77 has a local minimum in A'. Then 77 has at most one local minimum £ € A'. 
To see this, suppose there are two local minima £1, £2 G A' with 7/(£i) < ^(£2) = A < Ao- If 
t/(£i) = A then rj(x) = A has at least four roots in A', with double roots at both £1 and £2. If 
t/(£i) < A then rj(x) = A has at least three roots in A', with a double root at £2 and, by continuity, 
a root strictly between £1 and £2. In either case, we have a contradiction. It also follows that A is 
connected. Otherwise, since r/'(0) < 0, 77' (1 — 1 /k) > 0, each maximal interval of A' must contain a 
local minimum, a contradiction. Thus A = [0, 1 — l /k\. 

We have proved that 77 has at most one local minimum in (0, 1 — 1 /k), and we will denote this 
minimum point by £ G (0, 1 — 1 /k). It also follows that there are no local maxima of 77 in [0, 1 — 1 /k\, 
as we now prove. If there were a local maximum £' G [0, £) then T7'(0) < would imply that there is 
a local minimum in (0, £'), a contradiction. The same argument applies to the interval (£, 1 — i/fe], 
for k > 2. If k = 2, it is possible that x = l /2 is a local maximum, but it still follows that there can 
be no local maximum in (£, 1/2). 

To summarise: if k > 2 then 77 has exactly one stationary point £ G (0, 1 — 1 /k), a local minimum. 
If k = 2 then there is a unique local minimum £ G (0, V 2 ] but, if £ 7^ 1 /2, then 1/2 may be a local 
maximum. In either case, £ is the global minimum. 

It remains to identify the local minimum £ of 77 to a close enough approximation. Using (|56p . the 
condition that rj'(x) < is 

g'(x){ln(k(l - x)) + f'(x)(x - g(x)/g'(x))) > 0, x G (0, 1 - l/k). 
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We have shown that fix) < and g'(x) < for x € (0,1 — 1 /k), so the condition rj'(x) < is 
equivalent to 

g(x) In (fc(l - a;)) In (k(l - x)) 

We will now use (j62|) to show that £ is approximately l//c r_1 , except for the cases k = 2, r = 3, 4. 
(If r = 2 then £ = 1/A; r_1 exactly.) This will enable us to determine the value of c T ^ and establish 



that part (c) of Theorem 1 1 . 1 1 holds . 



3.3 The case k = 2 

We will first examine the case k = 2 in more detail. We must determine whether x = 1 /2 is 
a local minimum or maximum of r/. If it is a local minimum, then it is the global minimum. 
Otherwise, there is a unique local minimum ^ 6 (0, 1/2). To resolve this, we must examine 77 in the 
neighbourhood of x = 1 /2. We show in Lemma 14.121 that 1 /2 is a local minimum of 77 for 2 < r < 4, 
but is a local maximum if r > 5. Thus, for r = 2, 3, 4, the global minimum is £ = 1 /2. (Note that 
we include the case r = k = 2 here, though ultimately it plays no part in our analysis.) Hence 
from f|52[) and (|6Q|) we have that for r = 2, 3, 4, 



(2 r " 1 - l) 2 2 1 '- 1 (2 r ~ 1 - I) 2 
r(r — 1) r(r — 1) 



Cr,2 = t^ZI ZJZ TT = Z7Z — ■ ( 63 ) 



Specifically, 

c 2j2 = 1/2 = 0.5, c 3 , 2 = 3 / 2 = 1-5, c 4 ,2 = 49 /i2 ~ 4.0833. (64) 
Now u r: i = for all r, and ?i ri 2 = 2 r_1 In 2, so 

ti 2 , 2 ^ 1.3863, ii 3i2 ^ 2.7726, u 4 ,2 ^ 5.5452. (65) 

It follows that 

u r,i < c r ,2 < u r ,2 fo r r = 2, 3, 4, (66) 

as required. (We cannot use this result in Theorem 11.11 when k = r = 2, since there is no sharp 
threshold in this case.) 

In the cases k = 2, r > 5, there is a local minimum £ G (0, 1 /2), so the optimisation has similar 
characteristics to k > 3. We consider these cases in Section [3.51 below. 



3.4 The case r = 2 

We will consider the case r = 2 separately, since 77 can be minimised exactly in this case. The 
results given in this section were obtained by Achlioptas and Naor in [3], by making a nonlinear 
substitution in r\. We can derive their results more simply, since we know that r/ has a unique 
minimum. We have 

g (x) = (i-^ + tttt- 

2k f k-l \ 
sog(x) = —(x-—). 



k-1 



k - 1\2 



X 



22 



It follows that 



= l(x+(k-l)/k). 

Hence (|62p implies that x minimises r\ if and only if 

k-1 2ln(k{l-x)) 



x + 



k ' ln((fe- 1)(1 -x)/x) ' 



It is easily verified that x = 1/k satisfies this equation, and hence is the unique minimum of r] in 
[0,1-i/fc]. 

We have dealt with the case k = 2 in the previous section, so we now assume that k > 3. Then 

. , f(l/k) (fc-l)ln(fc-l) 

mm ri(x) = ) '- ' = - / — 

*6[o,i-yfc] IK ' g(l/k) k-2 

and hence 

(k ~ I) 2 (fc-l) 3 ln(fc-l) 

C2 < fc = —k—^ 1/k) = Hk—J) • (67) 

It follows that for k > 3, 

U2,k-i = (k - 1) ln(/s - 1) < c 2 ,A: < fcln/c = u 2 ,fc, (68) 

as required. 



3.5 The general case 

We now consider the remaining cases r>3or/c = 2,r>5. We will do this by finding values 
w,y G (0, 1 — l /k) such that rj'(w) < and rf {y) > 0. That is, w satisfies (|62|) . but y does not. 
The uniqueness of ^ then implies that w < £ < y, and we will use this to place a lower bound on 
V(0- We will achieve this for all pairs r, k except for a small number, and we will solve these few 
remaining cases numerically. 

To simplify the analysis, we will exclude some cases initially. Thus we assume below that 

k = 2, r > 9 or k = 3, r > 4 or k > 4, r > 3. (69) 
By Lemma 14.13} the inequality 

r 2 {k + 2)/k r < 1 (70) 

holds whenever (I69j) holds. 

First we set x = w in ()62f) . where w = (k — l)/k r . Note that u> < 1/r 2 , from (jTOf) . Using 
Lemmas 14.31 14.41 and 14. 6| we have 

r In (A;(l — w)) rlnk — rw 1 — w/lnk 

ln((k-l)(l-w)/w) < rink- Zw/2 ~ 1 - 3w/(2rlnfc) 

<( 1 -i)( 1 + ^)< 1 -( 1 -')^^- <-) 

since r > 3. 
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Using Lemma 14.71 we have 



w r 1 Aru; (A; - l)r + A; 

= (i-«0 + (fc _ ir -i -F^ ^ 1 - rw -fc3i = 1 - k-l w > 

~ 9 ' {W) " (1 " ^f" 1 " 77^^ < (1 " vf- 1 < 1 



r v ' (Ar-l) 7 '- 1 ~ v ' ~ l + (r-l)w 

So we have 

rg(w) ( (k — l)r + k \ , . N 

ru; 77 — r- > rw + 1 w (1 + (r — l)u>) 

<?'(itf) V fc — 1 / 

V fe - 1 v fc— 1 fc r / 

/ 2fc - 1 r 2 \ 

> 1+ y-T^r-k^r 

1 r 2 



and the right hand side is bounded below by 1 whenever 



1 r 2 

+ 7-T<r-2. (72) 



k - 1 A^" 1 



We may easily show that the left hand side of (172ft is decreasing with r for r > 3, and it is clearly 
decreasing with k > 2. The right hand side is independent of k and increasing with r. Now (|72p 
holds by calculation when (At, r) G {(2,5), (3,4), (4,3)}. Therefore (f72j) holds for all (A;, r) which 
satisfy ([69]) . and combining this with (fTT|) shows that u; satisfies ([62]) . as desired. 

We now set x = y in (|62p . where y = {k + 2)/k r . We have ry < 1/r from (|70|) . Then, using 
Lemmas 14.31 and 14.41 we have 

rln(A;(l-y)) lnF-3ry/2 ^ lnA; r -3ry/2 



ln((A;-l)(l-y)/y) In Ar r + In (l - 3/{k + 2)) + ln(l - y) In k r - 3/(k + 2) 

3/(fc + 2) - 3ry/2 3/(fc + 2) - 3n//2 

lnA; r -3/(A; + 2) rlnfc 

Using Lemma 14771 

9(V) = (1 " y) r + 71 4i - 7^1 < 1 - ry + ±(n/) 2 + "' A '" 



(A: - l) 1 ' -1 fc r_1 - * 2V y/ (fc-1)'- 1 At + 2 
12. y K 



= l -V-^ ry -Jk^W ZT + k^)^ 
-g'{y) _ Nr-i y r_1 ^ n / ^ y r_1 



(A: - l)^- 1 - * v ^ a (Ar-l)^ 1 



l-(r-l+ „_ g ^ r _ 1 )y. 



Now y r /(A; — l) r < 1 for k,r > 2 and ry < 1/r < 1/2, using Lemma 14.61 Therefore 



<?'(y) " V {k-iy-iJ* V (fc-i) 
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< 1 + { r - 1 + J k ~^ + 2r2y ) y - 

Thus 



So r/'(y) > if y does not satisfy (|62|) ; that is, if 

3/(fc + 2) - 3ry/2 / 5r 2 y (1 + y)y- 2 
> r — H h 



rlnfc V 2 (fc-1)''- 1 fc + 2 

Dividing by y and rearranging gives the equivalent condition 

3fc r 2 3 5r 2 y (1 + y)y r ~ 2 

> r-2 + — + — + —^+ \ tm . (73) 



r(fc + 2) 2 lnfc k + 2 2 lnfc 2 (k - l) r_1 

From Lemma 14.131 we have r 2 y < 1 and that y = {r 2 y)jr 2 is decreasing with both r and k. Since 
y < 1, it follows easily that (1 + y)y r ~ 2 /(k — l) r_1 is decreasing with r and k. We may now check 
numerically that (1 + y)y r ~ 2 /(k — l) r_1 < 1/50 for all k, r satisfying ([69]) . It follows that ([73j) is 
implied by the inequality 

3fc r ^ 1 0.52 2 3 

> 1 + — + r77^ + ^7TT- ( 74 ) 



r 2 (fc + 2) 2 lnfc ~ r r(fc + 2) 2rlnfc' 

We show in Lemma 14.141 that, if (I74p holds for some r > 3, fc > 2, then it holds for any r', A;' such 
that r' > r, k' > k. We may verify numerically that (|74p holds for the following pairs r, k. 

k = 2, r = 9, = 3, r = 6, A; = 4, r = 5, fc = 5, r = 4, fc = 15, r = 3. 

Thus it holds for all pairs r, fc such that 

fc = 2, r > 9, fc = 3, r > 6, fc = 4, r > 5, fc E {5, . . . , 14}, r > 4, fc > 15, r > 3. 

Let us call these the pairs (fc, r) regular, with the remaining nineteen pairs being irregular. We deal 
with the irregular pairs below by numerical methods. 

First we continue our focus on regular pairs. For such pairs we have argued that (fc — l)/fc r < £ < 
(fc + 2)/fc r and hence, using Lemmas 14.51 and 14.71 

/(0 = lnfc-£ln(fc-l)+£ln£ + (l-£)m(l-£) > lnfc - (rlnfc + 1)£ , 

9(0 = (i-£) r +r/(*-i) r - 1 -iA r - 1 < v(i+ re). 

Hence, using Lemma |4.15| 

r/(f) > (lnfc- (rlnfc + l)£)(l + r£) = lnfc - f - r(r lnfc + 1)£ 2 > lnfc-2£. 
From (I52p . we can now determine 



(^-!) 2 ^ 2 ( fc + 2 n . fU -x 
k r 



c r , k > V ' (lnfc --4^) > (fc r - 1 -2)lB(fc-l), 
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using Lemma EH Now k^ 1 > (k - l) r_1 + (r - l)(jfe - l) r ~ 2 > (fc - If" 1 + 2 for r > 3, k > 2, 
which shows that 

u r>k -i = (k- l) r-1 ln(k - 1) < c rjfc 

for all regular pairs. 
We also have 

Cr,k < ^ ~ 1<® = (fcr fc L"l 1)2 £-1-1 = ^ r_1 " 1)lnfc < kr ' Hnk = U ^ (75) 

as required, and this holds for all r, k > 2. 

Next we consider irregular pairs and use (|62p to bound £ numerically, by bisection. This is quite 
straightforward, since we know that £ G (0,1 — V fe ) is unique. The resulting values of c r) fc are shown 
below, along with the corresponding values of u rj k~i and u T ^- 



k 


T 


1Lrr> U 1 








5 


0.0000 


9.8771 


11.0904 




6 


0.0000 


21.2990 


22.1807 


2 


7 


0.0000 


43.7678 


44.3614 




8 


0.0000 


88.3486 


88.7228 




3 


2.7726 


8.1566 


9.8875 


3 


4 


5.5452 


27.9595 


29.6625 




5 


11.0904 


87.4703 


88.9876 


4 


3 


9.8875 


20.0491 


22.1807 




4 


29.6625 


86.6829 


88.7228 


5 


3 


22.1807 


37.8417 


40.2359 


6 


3 


40.2359 


61.8958 


64.5033 


7 


3 


64.5033 


92.5637 


95.3496 


8 


3 


95.3496 


130.1457 


133.0843 


9 


3 


133.0843 


174.9034 


177.9752 


10 


3 


177.9752 


227.0688 


230.2585 


11 


3 


230.2585 


286.8499 


290.1453 


12 


3 


290.1453 


354.4353 


357.8266 


13 


3 


357.8266 


429.9977 


433.4764 


14 


3 


433.4764 


513.6960 


517.2552 



By inspection, u Ty k-i < c T .k < Ur,k for all irregular pairs. 

We have already proved most of Lemma 12.21 and we complete the task below. 

Proof of Lemma \2.SX The above analysis shows the existence of constants c T ^ for all r, k > 2 
such that F has a unique maximum at J whenever c < c rk . Combining the numerical results for 



irregular pairs with (|64p . (|65p . (|66|) and (|68p establishes part (c) of Theorem II. 1\ as required for 
Lemma 12.21 

It remains to prove that for all k, r > 2 we have 

(k r - 1 - l) 2 



r (r 
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This follows from (|63p if k = 2 and r = 2, 3, 4. In all other cases we have c r ^ < (k 1 *' 1 — 1) In k, from 
(I75p . Furthermore, it follows from Lemma 14.181 that r(r — 1) In k/{k r ~ 1 — 1) < 1 whenever fc > 3, 
r>2, or/c = 2,r>5. This completes the proof of Lemma 12.21 □ 

3.6 Asymptotics 

We have given precise bounds on c r ^, but if we require only asymptotic estimates as r — > oo and/or 
k — > oo, the following simplified analysis suffices. 

Remark 3.4. When we write "r — > oo and/or k — > oo", this is not to be interpreted as "r(n) — > oo 
and/or k(n) —> oo", but merely as "r and/or k are arbitrarily large constants". Otherwise, we 
cannot use Theorem ll.3l to establish the existence of a sharp threshold between c r ^ and u T ^. This 
is the approach to asymptotic estimates taken, for example, in [3]. 

We will use (|4"5j) to improve the estimate of c r ^ asymptotically, as discussed in Remarks 13.21 and l3.31 
First let us consider the maximum possible improvement that we might be able to achieve. 

From Remark 13.31 we know that the maximum value of z in f)19j) — f|23 |) cannot be smaller than that 
given by taking t = k in (|48|) . Thus we may bound the possible increase in c r ^ as follows. Since 
giP) < 1 - l/k r - 1 and t < k, it follows from (09]), using LemmaEO] that i? < l/(fc r_1 - 1) in (1501) . 
Thus X^£o^V(^ + l)! ^ 1 + i?/2 + 0(i? 2 ) in (foTJ]) . Therefore we can increase c r ^ asymptotically by a 
factor at most 1 + l/(2fc r ) + 0(l/fc 2r ~ 2 ). Since c r ^ < u T) k = k r ~ l In k, the additive improvement 
to c T) k from fully optimising (|19 p - (|23p is at most 1 /2 In k + 0(ln k/k r ^ 1 ). Hence we cannot improve 
c r) fc asymptotically by more than an additive term 1/2 In A;. 

Now let us consider what improvement we can rigorously justify. From Remark 13.21 we know that 
we can take t = 2 in ([48]). Let k = 4(k - l)/k r , and 1Z = {x G E : 1/F < s < k}. We proved in 
Sections I3.3H3. 51 that the minimum of 7](x) for x € [0, 1 — 1 /k] lies in [(/c — l)/k r , (k + 2)/A; r ] C 71 for 
all k, r > 2. However, all we require here is the fact that rj has a unique minimum in [0, 1 — l /k], as 
shown in Section 13.21 Now we may approximate 

f(x) = Ink — x ln(/c — 1) + xlnx — x + 0{x 2 ) , 
g (x) = l-rx- l/k 1 '' 1 + 0(r 2 x 2 ). 

Hence, using Lemma 14.61 and noting that — lnx = 0(r In k) since x > l/k r , in 1Z we have 

r](x) = ( In k — xln(A: — 1) + xlnx — x + 0(x 2 )) (l + rx + l/k 1 *' 1 + 0(r 2 x 2 )) 
= (1 + 1/A; r_1 )ln + x(rlnfc - ln(A: - 1) - 1) + xlnx + 0(r 2 x 2 Ink) . 

Therefore, let (p be the function defined by 

(f(x) = (1 + l/k 1 *' 1 ) In k + x(r In k — ln(k — 1) — 1) + xlnx . 

We have seen that <p approximates ij. Now 

(p'(x) = (rink- ln(k - 1) - 1) + 1 + lnx = lnx - In ((k - l)/k r ) , 
ip"{x) = 1/x > 0. 

Thus (f(x) is minimised at ^ = (k — l)/k r G 1Z, as expected. We can write 

ip(x) = (l + l/A: r " 1 )lnA;-x + xln(x/|). (76) 
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In particular, 

<p(g) = (l + l/F-^lnfc-e- 
Hence, reinstating the error term in (|5ip . we may take 

(k r ~ 1 - l) 2 / / 1 \ , k-1 /r 2 ln£;\\ 

= (.--l)ln*-^-o(^). (77) 

Since k = 4£, using ([TS]) we have, 

<p(.K) - <p(,£) = (|-k) + /c1h(k/|) = -3£ + 4fln4 > 2.5 1 = 2.5 (A; - l)/fc r . 

Therefore, since r\ has a unique minimum in [0, 1 — l /k], we have 

> <p(i) - 0(r 2 lnk/k 2r - 2 ) (x < «) 

V(x) > <p{i) + 2.5(k-l)/k r -0(r 2 lnk/k 2r ~ 2 ) (x > k) . 

We have g(x) = 1 — 0(r /k r ^ 1 ) when x < k, and hence = 2/k r — 0(r/k 2r ~ 1 ), taking t = 2 in ([39]) . 
Thus the factor (e tf - in (J50J is 1 + l//c r - 0(r/& 2r ~ 1 ). This is effectively the maximum value 
of (e^ — for x G [0, 1 — !/fc] and (e 15 — is effectively constant for x < k. Thus 

mm( V (x)^-^) > ^(e)(l + l/A: r )-0(r 2 lnA:/A; 2r - 2 ) 

X<K \ V J 

= + In k/k r - 0(r 2 In £;/A; 2r - 2 ) , 
since <p(£) = lnk + 0(ln k/k^ 1 ). Also, > in [0, 1 - i/fe], so 

minf^a?)^-^) > + 2.5(fc - 1)/F - 0(r 2 In fc/A; 2 ^ 2 ) (78) 
> + lnk/k r - 0(r 2 In k/k 2r - 2 ) , 

for any fc, r > 2, provided /c r is large enough. Thus, after multiplying the right side of (178p by 
(/c r_1 — 1) 2 /A; r '~ 1 , the additive improvement in c ri fc is \nk/k-0(r 2 lnk/k r - 1 ). Applying this to ([77]) . 
we have 

C,* = ^- 1 lnA;-^-(l + lnA;)-0(^^). (79) 
Substituting k = 2 in (T79]) . 

Cr . j2 = 2 r ~ 1 In 2 - 1/2(1 + In 2) - 0(r 2 /2 r ) , 

the result obtained by Achlioptas and Moore [3] for 2-colouring r-uniform hypergraphs. The case 
r = 2 (colouring random graphs), studied by Achlioptas and Naor [3], is discussed further below. 
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Remark 3.5. The best lower bound on u T) t is it r ,fc = Ur,k — V^ln/c from Remark 12. 1| so there is a 
gap 

fc-2 , /r 2 lnfc 



n r , fc - c r , fc = + In fc + O v fcr _ 1 

Asymptotically, this gap is always nonzero, though extremely small compared to c T ^ or u r ^. It 
is independent of r (up to the error term), and grows slowly with k. It is minimised when k = 2 
and r — > oo. The existence of this gap merely indicates that the second moment method is not 
powerful enough to pinpoint the sharp threshold. We know from Theorem 11.31 that the threshold 
lies in [c rj k,u r ^], although it is possible that it does not converge to a constant as n — > oo. Note 
that if we could obtain the maximum possible correction l /2lnk, as discussed above, then the gap 
would be approximately (k — l)/k, and hence uniformly bounded for all k, r > 2 except k = r = 2. 



Observe that the asymptotic estimate of c r ^ given in (|79p is not sharp in one case, namely when 
r = 2 and k — > oo. Here the error in (|79|) is 0(lnk/k), so we have not improved (|77p . Since this is 
the important case of colouring random graphs, we will examine it separately. 

From ()67p . we know that the bound on c<i,k from minimising n is precisely 

1)3 ■ln(fe-l) = k\nk -^±{1 + \nk) - \ - 0{ 1 ^-) . (80) 



k{k-2) y 1 k '2k \ k 2 

The right side of ([79]) is </?(£) + 0(lnk/k), so (fTHj) still implies that, when k is large enough, we need 
only consider i?(x) for x € 1Z. It follows, as above, that the factor (e —!)/'&= 1 + 1/k 2 — 0(l//c 3 ). 
Thus the additive improvement in c^^k is lnk/k — 0(lnk/k 2 ). Adding this to (]80p . we have 

fc-2, , 2fc-l ^/lnfc\ . . 

c %k = kink-— ]n* + - ljr -0( ir ) ) (81) 



which marginally improves ()68j) asymptotically. Note that, taken together, (1791) and (1811) exhaust 
the possibilities for the manner in which r and/or k can grow large. 
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4 Appendix: Technical lemmas 



Lemma 4.1. G 6 Q*(n,r,cn) has at least (k — 1) isolated vertices a.a.s.. 



Proof. Define m = \cn\ and let Y(v) be the number of isolated vertices in G, determined by v. 
The mr entries of v are uniform on [n], from which it follows that E[Y] = n(l — l/n) mr ~ ne~ CT . 
Also, the entries of t> are independent, and arbitrarily changing any single entry can only change 
Y(v) by ±1. Thus we may apply a standard martingale inequality [TH Corollary 2.27] to give 



Pr (Y < lne~ cr 



r /12cr 



for large n. Thus G has fi(n) isolated vertices a.a.s., from which the result follows easily. 
Lemma 4.2. Suppose that M is a p x p matrix of q x q blocks, such that 



□ 



M 



2B 


B ■■ 


■ B 


B 


2B ■■ 


■ B 


B 


B ' 


. B 


B 


B ■■ 


■ 2B 



ppqxpq 



where B 



2 1 
1 2 



Then det(M) = (p + + 1)p. 



Proof. We have, by adding and subtracting rows and columns of M, 



det M = det 



2B 


-B •• 


• -B 




\p+l)B 


•• 


• 


B 


B •• 







B 


B ■■ 


• 








= det 








B 










B 


' 


• 


B 


•• 


• B 




B 


•• 


• B 



= det ((p + l)B) (det Bf' 1 = (p + l) 9 (det Bf. 

We can use the same transformations to compute det-B, replacing B by the lxl matrix (1) in the 
argument. We obtain det B = (q + 1) l 9 " 1 = q + l. Hence det M = (p + l) q (q + l) p . □ 

Lemma 4.3. ln(l + z) < z for all z > —1. 



Proof. Let <j)(z) = z — ln(l + z), which is strictly convex on z > —1, since ln(l + z) is strictly 
concave. Also (f>'(z) = 1 — 1/(1 + z), so (p is stationary at z = 0, and this must be its unique 
minimum. Since </>(0) = 0, we have 4>(z) > for all z > — 1, and </>(z) > if z 7^ 0. □ 

Lemma 4.4. ln(l -z)> -3z/2 for all < z < 1/2. 

Proof. Let 0(z) = ln(l — z) + 3z/2. Then is strictly concave on [0, 1), since ln(l — z) is strictly 
concave. Also 4>'{z) = —1/(1 — z) + 3 /2, so is stationary at z = 1/3, and this must be its unique 
maximum. Now </>(0) = 0, and we may calculate (fti 1 /?) > 0, so 4>(z) > for < z < 1 /2. □ 

Lemma 4.5. For all z G (0, 1), (1 — z) ln(l — z) > — z and (1 — \z) ln(l - z) < —z. 
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Proof. We have 



(1 - z) ln(l - z) = -z + V] Z > -z, 



OC 



( 1 -i')Mi-») = -»-E^<-* ° 

Lemma 4.6. 1 + z < 1/(1 - z) < 1 + z + 2z 2 < 1 + 2z for all < z < 1/2. 

Proof. The first inequality is equivalent to z 2 > if z < 1. The second inequality is equivalent to 
z < 1/2. The third follows trivially from the second. □ 

Lemma 4.7. For p G N, z G [0, 1], 1 -pz < (1 - zf < 1 -pz + \{pz) 2 . Also (1 - z) p < 1/(1 + pz). 

Proo/. Let ^(z) = (1 - zf - 1 + pz. Then fa(0) = and ^(z) = p(l - (1 - z)^ 1 ) > if 
z G [0, 1], giving the first inequality. Let 4>2(z) = 1 — pz + \ipz) 2 — (1 — z) p . Then </>2(0) = and 
0' 2 (z) = — p+p 2 z+p{l — z) p ~ l > — p+p 2 z+p(l — (p— l)z) = pz > 0, by the first inequality, giving the 
second. For the third inequality, using Lemma l4T3l we have (1— z) p < e~ pz = l/e pz < l/(l+pz). □ 

Lemma 4.8. Let r](x) = f{x)/g{x) for x G [0, 1 — 1/k], where 

f{x) = In A; - x \n(k - 1) + (1 - x) ln(l - x) + x \nx, g(x) = (1 - x) r + x r /(k - l) r ~ a - l/k r ~ 1 . 
Then 

k r - l lnk , . , k r - x 

v(Q) = 7731 — 7: lim „ y( x 



k r - x -V x-tl-l/k 'r(r-l)' 

lim n'(x) = —00, lim rf(x)= ^— — . 

x^o x-+l-l/k r(k — 1) 

Proof. The first statement is immediate. Recall the calculations of /Section 13.21 Using L'Hopital's 
rule [15], 



lim r)(x) 
x-H-l/k 



f"(x) 
g"{x) 



k 



r-l 



1-1/k r(r-l)' 



Next, 



^/z n _ g{x)f'{x) - f(x)g'(x) = f'jx) - n(x)g'(x) 
g(x) 2 g(x) 

As x — > 0, all quantities in ([82]) are finite, except f'(x) — > —00. Since g(0) > 0, we have rf(x) — > —00 
as x — >• 0. 

For the final calculation note that asx = 1— 1/&, the numerators and denominators in the expression 
for r/(x) are all zero. Hence applying L'Hopital's rule again gives 

lim ri lx)= lim m^>m = lim /'w-v-(i-'A) 9 'w 

x-ti-i/k x-ti-i/k g(x) x-*\-i/k g[x) 

Um r(x)- Tya-iA)^^) 

x-+\-i/k g"(x) 
(k - 2)k r 

~ r(k-l) ' 

using the values of f"(l - 1/fc), /'"(l - 1/fc), s"(l - 1/k) and s"'(l - 1/fc) calculated in Section[321 □ 
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Lemma 4.9. Let k > 2 and r > 2. If r < 2k then the equation 



(k-lf- 1 Ar(r-l) 

has at most two solutions for x in [0, 1 — 1 /k\. Otherwise r > 2k + 1 and the above equation has at 
most two solutions for x in [0, 1 — 1 /k] whenever A < Ao, where 

1 , ^({r-2)2 r ' 1 1\ 
— = r (r - 1 ^ '- + — . 

A V ' V r r k r J 

Proof. Let 9{x) = x(l — x) r_1 and define k, uj by 1/k = (k — l) r_1 and 1/uj = Ar(r — 1). We wish 
to investigate the number of solutions of 4>{x) = oj, where 

0(z) = 6(X) + K0(1-X). 

Differentiating gives 

<f/(x) = e'(x) - k0'(i - x), 4>"(x) = e"( x ) + «0"(i - x). 

Thus the stationary points of (f> are the solutions of 9'{x) = k9'(1 — x). We may calculate 



e'(x) = (1- x) r ~ 2 (l-rx), 

9"(x) = -(r - 1)(1 - x) r " 3 (2 - rx), 



e'(l-x) = -x r - 2 {{r- 1) -rx), 

- x ) = (r - l)x r ~ 3 ((r - 2) - rx). 



We summarise the behaviour of 4> i n [0, 1] i n the following table. Here \. means "decreasing" , f 
means "increasing" . The final column gives the maximum number of stationary points of <f> in the 
corresponding subinterval of [0, 1]. 



x G [0,1/r) 

x G [l/r,2/r) 

x G (2/r,l - 2/r] 

x €(l-2/r,l- 1/r] 

x G (1 - l/r,l] 



0(x)t 
0'(x) 4. k9'{1 - x) t 
0(x) strictly convex 

e'{x) t - x) | 

</>(x) I 




1 
1 
1 





0'(s) > 0, 0'(l - x) < 
0"(x) < 0, 8"(1 - x) > 
^(x) > 0, 0"(1 - x) > 
6»"(x) > 0, 0"(1 - x) < 
0'(x) < 0, 9'(1 - x) > 

Now <fi is analytic on [0,1], with 4>'(0) = 1 and <j>'(l) = —n. Therefore <fi' changes sign an odd 
number of times in [0,1], which implies that <j) has an odd number of stationary points in [0,1]. 
From the table it follows that 4> has either one or three stationary points. Hence 4>(x) = u has at 
most four solutions in [0,1], for any uj. 

We first consider small values of r. When r = 2 the union of the first and last subinterval is 
[0, 1] \ {V 2 }i which contains no stationary point. Hence (j) has at most one stationary point in [0, 1] 
(and it can only occur at x = 1/2). When r = 3 the union of the first, second and last subinterval 
equals [0, 1] \ { 2 / 3 } an d contains at most one stationary point. Hence 4> has at most two stationary 
points in [0, 1]. 

When r = 4, the central subinterval is empty, so <j> has at most two stationary points in [0,1]. 
However, we know that an even number of stationary points is impossible, from above. Therefore 
when r = 2, 3, 4 the function (p has at most one stationary point in [0, 1], and hence at most two 
solutions to <f)(x) = uj in [0, 1], for any fixed uj. 
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Next we assume that r > 5, which implies that all five subintervals are nonempty. Either cj) has 
one stationary point which is a local maximum, or it has three stationary points: a local maximum 
fix, a local minimum \i 2 , and a local maximum ^3, with fj,% < fi 2 < ^3- 

Let 

L x = sup{0(y) : y e [1/r, 2/r)}, L 2 = sup{0(z) : z € (1 - 2/r, 1 - l/k]. 
(We take L2 = — co if there is only one stationary point.) First we show that 

Li > L 2 . (83) 

We readily see that 

(r - iy- 1 

L x > 4>(l/r) > 0(l/r) = 1 J . 
Next we calculate an upper bound on L 2 by considering two cases. First, if 2 < r < k then 

L 2 < 0(1 - 2/r) + K 0(l/r) = 1 ( (r - 2)2^ + (^A)^ 1 ) 
(r - 2)2 r ~ 1 + 1 , (r - 2)2 r " 1 + (r/2) r 



since 0(1 — x) is maximised at x = 1 — 1/r in [0, 1]. Next, if r > k > 2 then 0(1 — x) is maximised 
when x = l/k in [0, 1 — Therefore 

(r-2)2 r_1 1 (V - 2)2 T '~ 1 + (r/2T 

L 2 < 9(1-2 r) + K0(l k) = { - + -< { - > r + W > . (84) 

r r k r r r 

Thus ([83]) holds if (r - 2)2 r " 1 + (r/2) r < (r - l) r_1 . We show in Lemma KM that this is true for 
all r > 5, so (|83[) holds for r > 5. Now (/> has at least one local maximum, so we have established 
that 4> has a local maximum fi± S [1/r, 2/r) whenever r > 5. 

We now consider whether cp has a local minimum ^2 £ (2/r, 1 — 2/r). Since there is a local maximum 
[i\ S [1/r, 2/r) we know that (f)'(2/r) < 0. Thus 4> nas a local minimum \i 2 £ (2/r, 1 — 2/r] if and 
only if c//(l - 2/r) > 0. Now 

*<1 - 2/r) = «<(! - 2/ r) - = - 

This expression is certainly nonpositive if 2 > (r — 2)/(k — 1); that is, if r < 2k. So, if r < 2fc, 
there is no local minimum in (2/r, 1 — 2/r) and it follows that [i\ is the only stationary point of <f>. 
In this case, the equation <p(x) = lo has at most two solutions on [0, 1]. 

When r > 2k + 1 > 5 we know that 4>(x) = oj has at most two solutions for all u > L 2 , using (183p . 
From (|84p we have 

(r - 2)2 r ~ 1 1 
2 r r k r 

Substituting uj = (Ar(r — 1)) _1 we find that <j)(x) = lj has at most two solutions in [0, 1 — l/k] so 
long as 

1 1 {(r-2)2 r ~ l 1 V 1 
A>- — > -(- = + — I = A . □ 



r(r — 1)L 2 r(r — 1) \ r r k r 
Lemma 4.10. For all r > 5 the inequality (r — 2)2 r_1 + (r/2) r < (r — l) r_1 holds. 
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Proof. We will show (r - 2)2 r ~ 1 < (r - l) r ~ 1 /2 and (r/2) r < (r - l) r_1 /2. 
To show (r - 2)2 r ~ 1 < (r - l) r ~72, let 71 (r) = 2(r - 2)2 r -V(r- - l) p_1 . Then 

7i(r + 1) = 2 / r-l y < l 
71 (r) r — 2 V r / 

if r > 4. Thus 71 (r) is decreasing for r > 4. Since 71 (5) = 3 /s < 1, the inequality follows. 
To show (r/2) r < (r - l) 7 " -1 / 2 ; let 72 (r) = r r /(2r - 2) r ~ 1 . Then 

72(r+l) r+1 ^r 2 — l\ r < /r 2 — l\ r ^ ^ 



72 (r) 2r — 2 V r 2 / V r 2 

if r > 4. Thus 72(1") is decreasing for r > 4. Since 72(5) = 5 5 /2 12 < 1, the inequality follows. □ 
Lemma 4.11. For k > 2 and r > 2k + 1, we have 

Proof. The left hand inequality reduces to r(r — 1) In k/(k r ' 1 — 1) < 1. In Lemma 14.181 we show 
that r(r — 1) In k/(k r ~ 1 — 1) < 1 for all > 3, r > 2, or k = 2, r > 5. Clearly this includes all 
r > 2k + 1, and so establishes the left hand inequality. 

The right hand inequality is 

(r-2)2 r - 1 1 1 
r~ r ftF < /c^ 1 ' 

which is equivalent to 7(7", fc) < 1, where 

7(r ' fc) = 2^1 (t) • 

For fixed k > 2, if r > 2/c then 

7(r + l,/c) 2/c(r-l)r r 2kr(r - 1) 2 A; 

7(7-, fc) ~ (r-2)(r + l) r + 1 ~ (r-2)(r + l) 2 _ V < ' 

if r 2 (r — 1) < (r — 2)(r + l) 2 . This is equivalent to r 2 — 3r — 2 > 0, which is true for all r > 4. Thus 
7(7*, k) is decreasing in r, so we need only establish the critical case r = 2k + 1. We have 

2fc - 1 / 2Jfe \ 2fc+1 2fc — 1 / 2A; x 2 



7(2t+ i,*) = <_(__,<!, 

if (2k - 2)(2k + l) 2 - (2k - l)(2k) 2 > 0, which is 2k 2 - 3k - 1 > 0. This holds for all k > 2. □ 

Lemma 4.12. If k = 2 i/ten x = x /2 is a /ocaZ minimum of r\ for r = 2, 3, A, and a local maximum 
if r > 5. 

Proof. We have 

In 2 + x In x + (1 — x) ln(l — x) 



77(2:) 



(1 - x) r + x r - l/2 r 



r-l 
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Substituting x = (1 — z)/2, we find 

r](z) = (l-z)ln(l-z) + (l + z)ln(l + z) 
2 r - 1 ~ (1 - z) r + (1 + z)'' - 2 

We may compute Taylor expansions, giving 

r(r-l)r](z) z 2 + z 4 /6 + 0(z e ) 



2^-1 z 2 + (r-2)(r-3)z 4 /12 + <3(z 6 ) 

= i + 2 -< r -f r - 3 ^ + o(A 

If r = 2, 3 then the coefficient of z 2 is positive, so z = is a local minimum. If r > 5 then the 
coefficient of z 2 is negative, so z = is a local maximum. However, if r = 4, the coefficient of z 2 is 
zero, so we need a higher order approximation. We compute 

3r/(z) _ z 2 + z 4 /6 + z 6 /15 + Q(z 8 ) _ 1 + z 2 /6 + z 4 /15 + Q(z 6 ) _ 6 
~1T ~ z 2 + z 4 /6 ~ 1 + z 2 /6 ~ 15 (Z } ' 

The coefficient of z 4 is positive, and hence z = is a local minimum. □ 

Lemma 4.13. The function r 2 (k + 2)/k r is decreasing in both r and k for all r > 3, k > 2. Hence 
r 2 (k + 2)/k r < 1 if 

k = 2, r > 9, k = 3, r > 4, fc > 4, r > 3. 

Proo/. Let 0(r, k) = r 2 (k + 2)/A; r . Then 

0(r + l,fc) = (r + 1) 2 < i 
4>(r, k) kr 2 

if k > (1 + 1/r) 2 . Since (1 + 1/r) 2 < ie/ 9 for r > 3, this is satisfied for all k > 2. Also 

(f){r,k + l) _ (k + 3)k r (fc + 3)fc fc 2 + 3fc . : 



(f){r,k) {k + 2){k + l) r {k + 2){k + \) k 2 + 3k + 2 
We can now check numerically that r 2 (k + 2)/k r < 1 for (k,r) £ {(2,9), (3,4), (4,3)}. □ 
Lemma 4.14. If the inequality 

3k r 0.52 2 3 
> 1 + + ^ + 



r 2 (k + 2) 2 ln/c ~ r r(fc + 2) 2rln/c 

ZioWs for some (r, fc) mi/i r > 3, k > 2, then it holds for all (r' , fc') suc/t that r' >r,k'> fc. 

Proof. The right side of this inequality is decreasing with r and fc, so it suffices to show that the 
function 4>(r, fc) on the left side is increasing. This follows since, if fc > 2, r > 3, 

<p(r + 1, fc) kr 2 ^ 



</>(r,fc) (r + 1) 2 - 

Also, if r > 3 then 

<f>(r,k + l) _ (fc + l) r (fc + 2) 2 lnfc > (fc + l) 3 (fc + 2) 2 lnfc (fc + l)lnfc 



(/)(r,fc) fc''(fc + 3) 2 ln(fc + 1) - fc 3 (fc + 3) 2 ln(fc + 1) fcln(fc + l)' 
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since (k + 1)(A + 2) > k{k + 3) for all k > 0. Now we will have 

(fc + l)]nJfe _ (fc + l)/ln(fe + l) 
fcln(fc + l) ~~ A/In A: 



> 1 



if the function j(x) = x/lnx is increasing for x > k. Since j'(x) = (lnx — l)/(lnx) 2 > for x > e, 
we have 0(r, k + l)/0(r, A) > 1 for A > 3. For k = 2 and r > 3, we may verify that 

0(r,3) _ 161n2/3y > 541n2 1 



0(r,2) 251n3V2/ _ 25 In 3 
Thus 0(r, fc) is increasing in k and r for all k > 2, r > 3, and the conclusion follows. □ 
Lemma 4.15. For all regular pairs, r(r In A + 1)^ < 1. 

Proof. We have £ < (A + 2)/A r for all regular pairs. Thus the inequality is true if 0(r, A;) < 1, where 
cj)(r,k) =r(r In k + l)(k + 2)/k r . Now 

,. r(rlnA + l) A + 2 /rmA 1\ / 1 2 
<K r > *0 = To TT^ = r -^JT + 15 7^3 + 



1.2 Lr-2 I 7,2 'l.2/\ 1.1 — 3 l.r-2 

is decreasing with A > 2 for all r > 3, since In A/A 2 is decreasing for A > 2. Also 

(r + l,*) _ 1 ( 1\ ( 1+ ML\ < 1 C 1 + iV < 5 < , 



0(r, A) A; \ r /V r In A + 1 / A; \ r / 9 

if t > 3, A; > 2. Thus 0(r, A) is decreasing with r > 3 for A > 2. Direct calculation shows that 
0(9,2), 0(6,3), 0(5,4), 0(4,5), 0(3,15) are all less than 1. Thus r(r In A + 1) (A + 2)/A r < 1 for 
all regular pairs. □ 

Lemma 4.16. For all regular pairs, In k — 2(k + 2)/k r > ln(A — 1). 

Proof. Using LemmaES lnA-ln(A- 1) = -ln(l - 1/k) > l/k > 2(A + 2)/A r , provided 2 + 4/A < 
k r ~ 2 . The left hand side of 2 + 4/A < k r ~ 2 is decreasing, and the right hand side increasing, for 
all r, k. Thus we need only determine the smallest pairs r > 3, k > 2 which satisfy it. These are 
k = 2, r = 5, A = 3, r = 4 and k = 4, r = 3, which are not regular pairs. □ 

Lemma 4.17. For all k > 1, 4(A - 1) > y/klnk. 

Proof. Using Lemma 14.31 In A = 21n\/A < 2(\/A— 1). So the conclusion is implied by 2(A — 1) > 
A — y/k, which follows from 2(A — 1) > (A — 1) for all A > 1. □ 

Lemma 4.18. r(r - l)lnA/(A r " 1 - 1) < 1 for all k > 3, r > 2, ork = 2,r> 5. 

Proof. Let 0(r, A) = r(r — 1) In A/(A r_1 — 1). Then, for A > 3, r > 2, or A = 2, r > 5 we have 

0(r + l,A) (r + l)(A r ~ 1 - 1) r + l 



< ~, ~r~r~r < 1. 



0(r,A) (r - l)(A r - 1) ' (r - l)k 



Furthermore 



(r,A + l) _ {k r ~ l - 1) ln(A + 1) ln(A + 1) / k \ r - 1 < k/hxk 



0(r,A) ((A + l) r ~ l - 1) InA InA \k + lJ ~ (A + 1)/ ln(A + 1) 
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for k > 3, from the proof of Lemma 14.141 If k = 2, r > 5 then 

</>(r,3) _ (2 r ~ 1 - l)m3 16 In 3 
4>(r, 2) ~ (3r-i - l)ln2 < 81 In 2 < 

So </> is decreasing with both r and k. Now we may calculate that 

0(3,3) = 31n3/4 < 1, 0(5,2) = 41n2/3 < 1. □ 
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